Q-Value Based Particle Swarm Optimization for Reinforcement Neuro- Fuzzy System Design
نویسندگان
چکیده
This paper proposes a combination of particle swarm optimization (PSO) and Q-value based safe reinforcement learning scheme for neuro-fuzzy systems (NFS). The proposed Q-value based particle swarm optimization (QPSO) fulfills PSO-based NFS with reinforcement learning; that is, it provides PSO-based NFS an alternative to learn optimal control policies under environments where only weak reinforcement signals are available. The reinforcement learning scheme is designed by Lyapunov principles and enjoys a number of practical benefits, including the ability of maintaining a system's state in a desired operating range and efficient learning. In the QPSO, parameters on a NFS are encoded in a particle evaluated by Q-value. The Q-value cumulates the reward received during a learning trial and is used as the fitness function for PSO evolution. During the trail, one particle is selected from the swarm; meanwhile, a corresponding NFS is built and applied to the environment with an immediate feedback reward. The applicability of QPSO is shown through simulations in single-link and double-link inverted pendulum system. KeywordsNeuro-fuzzy system, particle swarm optimization, reinforcement learning, Q-learning.
منابع مشابه
Optimization and design of Adaptive Neuro-Fuzzy Inference System using Particle Swarm Optimization and Fuzzy C-Means Clustering to predict the scour after bucket spillway
Additionally, if the materials at downstream of bucket spillway are erodible, the ogee spillway is likely to overturn by the time. Therefore, the prediction of the scour after bucket spillway is pretty important. In this study, the scour depths at downstream of bucket spillway are modeled using a new meta-heuristic model. This model is developed by combination of the Adaptive Neuro-Fuzzy Infere...
متن کاملOnline Control of Nonlinear Systems using Neuro-Fuzzy Design tuned with Cooperative Particle Sub-Swarms Optimization
This paper proposes a TSK-type Neuro-Fuzzy system tuned with a novel learning algorithm. The proposed algorithm used an improved version of the standard Particle Swarm Optimization algorithm, it employs several sub-swarms to explore the search space more efficiently. Each particle in a sub-swarm correct her position based on the best other positions, and the useful information is exchanged amon...
متن کاملADAPTIVE NEURO-FUZZY INFERENCE SYSTEM OPTIMIZATION USING PSO FOR PREDICTING SEDIMENT TRANSPORT IN SEWERS
The flow in sewers is a complete three phase flow (air, water and sediment). The mechanism of sediment transport in sewers is very important. In other words, the passing flow must able to wash deposited sediments and the design should be done in an economic and optimized way. In this study, the sediment transport process in sewers is simulated using a hybrid model. In other words, using the Ada...
متن کاملAdaptive Neuro-Fuzzy Control Approach Based on Particle Swarm Optimization
This paper proposes a modified particle swarm optimization algorithm (MPSO) to design adaptive neuro-fuzzy controller parameters for controlling the behavior of non-linear dynamical systems. The modification of the proposed algorithm includes adding adaptive weights to the swarm optimization algorithm, which introduces a new update. The proposed MPSO algorithm uses a minimum velocity threshold ...
متن کاملDesign of a New IPFC-Based Damping Neurocontrol for Enhancing Stability of a Power System Using Particle Swarm Optimization
The interline power flow controller (IPFC) is a concept of the FACTS controller for series compensation which can inject a voltage with controllable magnitude and phase angle among multi lines. This paper proposes a novel IPFC-Based Damping Neuro-control scheme using PSO for damping oscillations in a power system to improve power system stability. The addition of a supplementary controll...
متن کامل